Goal-driven active learning
نویسندگان
چکیده
Abstract Deep reinforcement learning methods have achieved significant successes in complex decision-making problems. In fact, they traditionally rely on well-designed extrinsic rewards, which limits their applicability to many real-world tasks where rewards are naturally sparse. While cloning behaviors provided by an expert is a promising approach the exploration problem, from fixed set of demonstrations may be impracticable due lack state coverage or distribution mismatch—when learner’s goal deviates demonstrated behaviors. Besides, we interested how reach wide range goals same demonstrations. this work propose novel goal-conditioned method that leverages very small sets goal-driven massively accelerate process. Crucially, introduce concept active query demonstrator only hard-to-learn and uncertain regions space. We further present strategy for prioritizing sampling disagreement between policy maximized. evaluate our variety benchmark environments Mujoco domain. Experimental results show outperforms prior imitation approaches most terms efficiency average scores.
منابع مشابه
Learning as Goal-Driven Inference
types of learning activities. These are instantiated with domain-specific information in the context of the performance task to yield domain-specific learning goals which, in turn, are used to drive learning. 4 Goal Dependency Networks In general, an intelligent system will have multiple learning goals that are interrelated in a very complex manner. In order to reason about the interactions bet...
متن کاملTowards Goal-Driven Reflective Learning
Except for various ad hoc (and sometimes quite successful) systems, this de facto manifesto calling for the study of introspective systems did not give rise to what may be called "a general architecture for declarative and/or reflective machine learning". Some recent research taking place under the label of "goal-driven learning" signals however a renewed interest in these very basic issues: " ...
متن کاملLearning and Reusing Goal-Specific Policies for Goal-Driven Autonomy
In certain adversarial environments, reinforcement learning (RL) techniques require a prohibitively large number of episodes to learn a highperforming strategy for action selection. For example, Q-learning is particularly slow to learn a policy to win complex strategy games. We propose GRL, the first GDA system capable of learning and reusing goal-specific policies. GRL is a case-based goal-dri...
متن کاملIntegrated Learning for Goal-Driven Autonomy
Goal-driven autonomy (GDA) is a reflective model of goal reasoning that controls the focus of an agent’s planning activities by dynamically resolving unexpected discrepancies in the world state, which frequently arise when solving tasks in complex environments. GDA agents have performed well on such tasks by integrating methods for discrepancy recognition, explanation, goal formulation, and goa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Autonomous Agents and Multi-Agent Systems
سال: 2021
ISSN: ['1387-2532', '1573-7454']
DOI: https://doi.org/10.1007/s10458-021-09527-5